Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia

نویسندگان

  • Steven D Brown
  • Shilpa Nagaraju
  • Sagar Utturkar
  • Sashini De Tissera
  • Simón Segovia
  • Wayne Mitchell
  • Miriam L Land
  • Asela Dassanayake
  • Michael Köpke
چکیده

BACKGROUND Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published. RESULTS A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G + C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a phophoenolpyruvate synthase) and substrate utilization pathway (mannose and aromatics utilization) that might explain phenotypic differences between C. autoethanogenum and C. ljungdahlii. CONCLUSIONS Single molecule sequencing will be increasingly used to produce finished microbial genomes. The complete genome will facilitate comparative genomics and functional genomics and support future comparisons between Clostridia and studies that examine the evolution of plasmids, bacteriophage and CRISPR systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequence data for Clostridium autoethanogenum using three generations of sequencing technologies

During the past decade, DNA sequencing output has been mostly dominated by the second generation sequencing platforms which are characterized by low cost, high throughput and shorter read lengths for example, Illumina. The emergence and development of so called third generation sequencing platforms such as PacBio has permitted exceptionally long reads (over 20 kb) to be generated. Due to read l...

متن کامل

Genome editing of Clostridium autoethanogenum using CRISPR/Cas9

BACKGROUND Impactful greenhouse gas emissions abatement can now be achieved through gas fermentation using acetogenic microbes for the production of low-carbon fuels and chemicals. However, compared to traditional hosts like Escherichia coli or yeast, only basic genetic tools exist for gas-fermenting acetogens. To advance the process, a robust genetic engineering platform for acetogens is essen...

متن کامل

‘BALANCING AND SEQUENCING’ VERSUS ‘ONLY BALANCING’ IN MIXED MODEL U-LINE ASSEMBLY SYSTEMS: AN ECONOMIC ANALYSIS

With the growth in customers’ demand diversification, mixed-model U-lines (MMUL) have acquired increasing importance in the area of assembly systems. There are generally two different approaches in the literature for balancing such systems. Some researchers believe that since the types of models can be very diverse, a balancing approach without simultaneously sequencing of models will not yield...

متن کامل

Developing oncolytic Herpes simplex virus type 1 through UL39 knockout by CRISPR-Cas9

Objective(s): Oncolytic Herpes simplex virus type 1 (HSV-1) has emerged as a promising strategy for cancer therapy. However, development of novel oncolytic mutants has remained a major challenge owing to low efficiency of conventional genome editing methods. Recently, CRISPR-Cas9 has revolutionized genome editing.Materials and Methods: I...

متن کامل

Solving Single Machine Sequencing to Minimize Maximum Lateness Problem Using Mixed Integer Programming

Despite existing various integer programming for sequencing problems, there is not enoughinformation about practical values of the models. This paper considers the problem of minimizing maximumlateness with release dates and presents four different mixed integer programming (MIP) models to solve thisproblem. These models have been formulated for the classical single machine problem, namely sequ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2014